Skip to main content

Data Sources

Current Data Providers

Organizations are encouraged to become data providers themselves. For more information, please refer to the relevant section below.

AWI Electronic Publication Information Center
harvested entity:
name: AWI Electronic Publication Information Center
contact: epic@awi.de
provider: https://ror.org/032e6b942
providerType: publication database

data_sources:
- endpoint: https://epic.awi.de/cgi/oai2
harvester type: "oai"
parameters:
metadataPrefix:
- oai_dc
ignore_deleted: True
CISPA Research Information System
harvested entity:
name: CISPA Research Information System
contact: publications@cispa.saarland
provider: https://ror.org/02njgxr09
providerType: publication database

data_sources:
- endpoint: https://publications.cispa.saarland/cgi/oai2
harvester type: "oai"
parameters:
metadataPrefix:
- oai_dc
ignore_deleted: True
DESY Publications Database
harvested entity:
name: DESY Publications Database
contact: l.pubdb@desy.de
provider: https://ror.org/01js2sh04
providerType: publication database

data_sources:
- endpoint: https://bib-pubdb1.desy.de/oai2d
harvester type: "oai"
parameters:
metadataPrefix:
- oai_dc
- marcxml
ignore_deleted: True
DKFZ Library
harvested entity:
name: DKFZ Library
contact: zb.invenio@dkfz-heidelberg.de
provider: https://ror.org/04cdgtt98
providerType: publication database

data_sources:
- endpoint: https://inrepo02.dkfz.de/oai2d
harvester type: "oai"
parameters:
metadataPrefix:
- oai_dc
- marcxml
ignore_deleted: True
DLR Electronic Library
harvested entity:
name: DLR Electronic Library (elib)
contact:
- elib@dlr.de
- elib.support@dlr.de
provider: https://ror.org/04bwf3e34
providerType: publication database

data_sources:
- endpoint: https://elib.dlr.de/cgi/oai2
harvester type: "oai"
parameters:
metadataPrefix:
- oai_dc
ignore_deleted: True
DZNEPUB
harvested entity:
name: DZNEPUB
contact: j2-admin@dzne.de
provider: https://ror.org/043j0f473
providerType: publication database

data_sources:
- endpoint: https://pub.dzne.de/oai2d
harvester type: "oai"
parameters:
metadataPrefix:
- oai_dc
- marcxml
ignore_deleted: True
DataCite
harvested entity:
name: DataCite
contact: support@datacite.org
provider: https://ror.org/04wxnsj81
providerType: research information system
see_also:
https://datacite.org/
https://support.datacite.org/reference/introduction

data_sources:
- endpoint: https://api.datacite.org/graphql
harvester type: "datacite"
parameters:
ids:
- https://ror.org/032e6b942
- https://ror.org/02njgxr09
- https://ror.org/01js2sh04
- https://ror.org/04cdgtt98
- https://ror.org/04bwf3e34
- https://ror.org/043j0f473
- https://ror.org/02nv7yv05
- https://ror.org/02h2x0161
- https://ror.org/04z8jg394
- https://ror.org/02k8cbn47
- https://ror.org/03wy6tp26
- https://ror.org/02yvsmh51
- https://ror.org/0281dp749
- https://ror.org/01syejz95
- https://ror.org/04v4h0v24
- https://ror.org/02aj13c28
- https://ror.org/03qjp1d79
- https://ror.org/034rhsb33
- https://ror.org/00cfam450
- https://ror.org/01zy2cs03
- https://ror.org/03d0p2685
- https://ror.org/04t3en479
- https://ror.org/04p5ggc03
- https://ror.org/000h6jb29
id-type: ror

- endpoint: https://api.datacite.org/dois
harvester type: "datacite"
parameters:
ids: [dois found via graphql endpoint]
id-type: doi

GFZpublic
harvested entity:
name: GFZpublic
contact:
provider: https://ror.org/04z8jg394
providerType: publication database

data_sources:
- endpoint: https://gfzpublic.gfz-potsdam.de/oai/provider
harvester type: "oai"
parameters:
metadataPrefix:
- oai_dc
ignore_deleted: True

GSI Repository
harvested entity:
name: GSI Repository
contact:
- gsilibrary@gsi.de
- invenio-service@gsi.de
provider: https://ror.org/02k8cbn47
providerType: publication database

data_sources:
- endpoint: https://repository.gsi.de/oai2d
harvester type: "oai"
parameters:
metadataPrefix:
- oai_dc
- marcxml
ignore_deleted: True

HIFIS and Helmholtz Events
harvested entity:
name: HIFIS and Helmholtz Events
contact: support@hifis.net
description: A Helmholtz-hosted Events Management service for everyone within Helmholtz and their partners, based on CERN's Indico.
provider: https://ror.org/01js2sh04
providerType: event database
see_also:
- https://events.hifis.net/
- https://helmholtz.cloud/services/?serviceID=ec78dddd-f44b-4062-a1a1-ba9c0ddaa70b

data_sources:
- endpoint: https://events.hifis.net
harvester type: "indico"
parameters:
categories: [ 150,162,12,161 ]
token: 'indp_3SicijJvemsUSbm18kl9dEDk2OmAPXK1h2r4OtJQAF'
HZB Publication Server
harvested entity:
name: HZB Publication Server
contact: library@helmholtz-berlin.de
provider: https://ror.org/02aj13c28
providerType: publication database

data_sources:
- endpoint: https://www.helmholtz-berlin.de/pubbin/oai
harvester type: "oai"
parameters:
metadataPrefix:
- oai_dc
ignore_deleted: True
HZI OpenRepository
harvested entity:
name: HZI OpenRepository
contact: bibliothek@helmholtz-hzi.de
provider: https://ror.org/03d0p2685
providerType: publication database

data_sources:
- endpoint: https://repository.helmholtz-hzi.de/oai/request
harvester type: "oai"
parameters:
metadataPrefix:
- oai_dc
- marc
ignore_deleted: True

JuSER
harvested entity:
name: JuSER
contact: juser@fz-juelich.de
provider: https://ror.org/02nv7yv05
providerType: publication database

data_sources:
- endpoint: https://juser.fz-juelich.de/oai2d
harvester type: "oai"
parameters:
metadataPrefix:
- oai_dc
- marcxml
ignore_deleted: True
Jülich Data
harvested entity:
name: Jülich Data
contact: forschungsdaten@fz-juelich.de
provider: https://ror.org/02nv7yv05
providerType: data repository

data_sources:
- endpoint: https://data.fz-juelich.de/sitemap.xml
harvester type: "sitemap"
parameters:
match_pattern: '.*/dataset.xhtml.*'
export_api:
- replace: [ "dataset.xhtml?", "api/datasets/export?exporter=schema.org&" ]

KIT Indico
harvested entity:
name: KIT Indico
contact: indico-support@scc.kit.edu
provider: https://ror.org/04t3en479
providerType: event database
see_also:
- https://indico.kit.edu/

data_sources:
- endpoint: https://indico.scc.kit.edu
harvester type: "indico"
KITopen
harvested entity:
name: KITopen
# hier den richtigen Ansprechpartner erfragen
contact: infodesk@bibliothek.kit.edu
provider: https://ror.org/04t3en479
providerType: publication database

data_sources:
- endpoint: https://dbkit.bibliothek.kit.edu/oai/eva/
harvester type: "oai"
parameters:
metadataPrefix:
- oai_dc
ignore_deleted: True
MDC Repository
harvested entity:
name: MDC Repository
contact: eprints@mdc-berlin.de
provider: https://ror.org/04p5ggc03
providerType: publication database

data_sources:
- endpoint: https://edoc.mdc-berlin.de/cgi/oai2
harvester type: "oai"
parameters:
metadataPrefix:
- oai_dc
ignore_deleted: True

OceanRep GEOMAR
harvested entity:
name: OceanRep GEOMAR
contact: bibliotheksleitung@geomar.de
provider: https://ror.org/02h2x0161
providerType: publication database

data_sources:
- endpoint: https://oceanrep.geomar.de/cgi/oai2
harvester type: "oai"
parameters:
metadataPrefix:
- oai_dc
ignore_deleted: True
PANGAEA
harvested entity:
name: PANGAEA
contact: https://www.pangaea.de/contact/
provider: https://ror.org/032e6b942
providerType: data repository

data_sources:
- endpoint: https://doi.pangaea.de/sitemap.xml
harvester type: "sitemap"

Publikationen aus dem HZDR
harvested entity:
name: Publikationen aus dem Helmholtz-Zentrum Dresden-Rossendorf
contact: s.schmittAthzdr.de
provider: https://ror.org/01zy2cs03
providerType: publication database

data_sources:
- endpoint: https://www.hzdr.de/publications/OAI-PMH
harvester type: "oai"
parameters:
metadataPrefix:
- oai_dc
ignore_deleted: True
Publikationsserver des Helmholtz Zentrums München
harvested entity:
name: Publikationsserver des Helmholtz Zentrums München (PuSH)
contact:
- astrid.uerlichs@helmholtz-muenchen.de
- markus.hagemann@helmholtz-muenchen.de
provider: https://ror.org/00cfam450
providerType: publication database

data_sources:
- endpoint: https://push-zb.helmholtz-munich.de/oai2/
harvester type: "oai"
parameters:
metadataPrefix:
- oai_dc
ignore_deleted: True

Research Organization Registry (ROR)
harvested entity:
name: Research Organization Registry (ROR)
contact: support@ror.org
providerType: organization metadata database

data_sources:
- endpoint: https://api.ror.org/v2/organizations/
harvester type: "ror"
parameters:
ids:
- https://ror.org/032e6b942
- https://ror.org/02njgxr09
- https://ror.org/01js2sh04
- https://ror.org/04cdgtt98
- https://ror.org/04bwf3e34
- https://ror.org/043j0f473
- https://ror.org/02nv7yv05
- https://ror.org/02h2x0161
- https://ror.org/04z8jg394
- https://ror.org/02k8cbn47
- https://ror.org/03wy6tp26
- https://ror.org/02yvsmh51
- https://ror.org/0281dp749
- https://ror.org/01syejz95
- https://ror.org/04v4h0v24
- https://ror.org/02aj13c28
- https://ror.org/03qjp1d79
- https://ror.org/034rhsb33
- https://ror.org/00cfam450
- https://ror.org/01zy2cs03
- https://ror.org/03d0p2685
- https://ror.org/04t3en479
- https://ror.org/04p5ggc03
- https://ror.org/000h6jb29
id-type: ror
Rossendorf Data Repository
harvested entity:
name: Rossendorf Data Repository
contact: https://rodare.hzdr.de/support
provider: https://ror.org/01zy2cs03
providerType: publication database

data_sources:
- endpoint: https://rodare.hzdr.de/sitemap.xml
harvester type: "sitemap"
parameters:
match_pattern: '.*/record/\d'
UFZ Publication Repository
harvested entity:
name: UFZ Publication Repository
contact: publikationen@ufz.de
provider: https://ror.org/000h6jb29
providerType: publication database

data_sources:
- endpoint: https://web.app.ufz.de/publikationsdatenbank/oai-pmh/
harvester type: "oai"
parameters:
metadataPrefix:
- oai_dc
ignore_deleted: True

Data Provider Manifests

Administrative Information

This section provides an overview of all data sources integrated into the knowledge graph. The configuration files are organized by their respective data providers and consist of two main sections.

The first section follows the format:

harvested entity:
name:
contact:
provider:
providerType:

It contains manually curated information, primarily sourced from the data provider’s website (if available).

If the name field does not include at least an abbreviation of the provider, one is added manually. The provider field is populated with a ROR ID corresponding to the associated research organization. The providerType field uses a controlled vocabulary and may take one of the following values:

  • publication database
  • event database
  • data repository
  • organization metadata database
  • research information system

Technical Information

The second section contains technical details about the endpoints harvested for the knowledge graph:

data_sources:
- endpoint:
harvester type:
parameters:
  • specifies the URL from which data is retrieved.
  • harvester type indicates the harvester used within the data pipeline.
  • parameters define technical settings. For further details on their usage and rationale, refer to the corresponding codebase in the gitlab repository can be checked.